A Reinforcement Learning Architecture that Transfers Knowledge between Skills when Solving Multiple Tasks

نویسندگان

  • Paolo Tommasino
  • Daniele Caligiore
  • Marco Mirolli
  • Gianluca Baldassarre
چکیده

When humans learn several skills to solve multiple tasks, they exhibit an extraordinary capacity to transfer knowledge between them. We present here the last enhanced version of a bio-inspired reinforcementlearning modular architecture able to perform skill-to-skill knowledge transfer and called ‘TERL Transfer Expert Reinforcement Learning model’. TERL architecture is based on a reinforcement-learning actorcritic model where both the actor and the critic have a hierarchical structure, inspired by the mixture-of-experts model, formed by a gating network that selects experts specialising in learning the policies or value functions of different tasks. A key feature of TERL is the capacity of its gating networks to accumulate, in parallel, evidence on the capacity of experts to solve the new tasks so as to increase the responsibility for action of the best ones. A second key feature is the use of two different responsibility signals for the experts’ functioning and learning: this allows the training of multiple experts for each task so that some of them can be later recruited to solve new tasks and ∗P. Tommasiono is with the Nanyang Technological University, Robotics Research Centre, Singapore. E-mail: [email protected] †D. Caligiore, M. Mirolli, and G. Baldassarre are with the Laboratory of Computational Embodied Neuroscience (LOCEN), Istituto di Scienze e Tecnologie della Cognizione (ISTC), Consiglio Nazionale delle Ricerce (CNR), Roma, Italy. E-mail: {daniele.caligiore,marco.mirolli,gianluca.baldassarre}@istc.cnr.it ‡Manuscript received ; revised .

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Deep Hierarchical Approach to Lifelong Learning in Minecraft

The ability to reuse or transfer knowledge from one task to another in lifelong learning problems, such as Minecraft, is one of the major challenges faced in AI. Reusing knowledge across tasks is crucial to solving tasks efficiently with lower sample complexity. We provide a Reinforcement Learning agent with the ability to transfer knowledge by learning reusable skills, a type of temporally ext...

متن کامل

Autonomous Hierarchical Skill Acquisition in Factored MDPs

Learning hierarchies of reusable skills is essential for efficiently solving multiple tasks in a given domain. Understanding the causal relationships between one’s actions and various dimensions of one’s environment can facilitate learning of abstract skills that may be used subsequently in related tasks. Using Bayesian network structure-learning techniques and structured dynamic programming al...

متن کامل

Effective Control Knowledge Transfer through Learning Skill and Representation Hierarchies

Learning capabilities of computer systems still lag far behind biological systems. One of the reasons can be seen in the inefficient re-use of control knowledge acquired over the lifetime of the artificial learning system. To address this deficiency, this paper presents a learning architecture which transfers control knowledge in the form of behavioral skills and corresponding representation co...

متن کامل

Evolving Childhood’s Length and Learning Parameters in an Intrinsically Motivated Reinforcement Learning Robot

The capacity of re-using previously acquired skills can greatly enhance robots’ learning speed and behavioral complexity. ‘Intrinsically Motivated Reinforcement Learning (IMRL)’ is a framework that exploits this idea and proposes to build agents capable of solving several specific tasks by assembling general-purpose building-block behaviors (‘skills’) previously acquired on the basis of ‘intrin...

متن کامل

Skill Acquisition Via Transfer Learning and Advice Taking

We describe a reinforcement learning system that transfers skills from a previously learned source task to a related target task. The system uses inductive logic programming to analyze experience in the source task, and transfers rules for when to take actions. The target task learner accepts these rules through an advice-taking algorithm, which allows learners to benefit from outside guidance ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016